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Abstract 

In  recent  empirical  studies  we  have  shown  that  many  interesting  cost-based  abduction  problems 
can  be  solved  efficiently  by  considering  the  linear  program  relaxation  of  their  integer  program 
formulation.  We  tie  this  to  the  concept  of  total  unimodularity  from  network  flow  analysis,  a  fun¬ 
damental  result  in  polynomial  solvability.  From  this,  we  can  determine  the  polynomial  solvability 
of  abduction  problems  and,  in  addition,  present  a  new  heuristic  for  branch  and  bound  in  the 
non-polynomial  cases. 


1.  Introduction 

In  cost-based  abduction  [3,9],  hypotheses  have  associated  costs,  and  the  cost  of 
a  proof  is  simply  the  sum  of  the  costs  of  the  hypotheses  required  to  complete  that 
proof.  Examples  of  such  proofs  can  be  found  in  [1,3,9],  Central  to  this  approach  is 
the  use  of  directed  acyclic  graphs  called  WAODAGs  (or,  weighted  and/or  directed 
acyclic  graphs)  to  represent  relationships  between  hypotheses  and  the  evidence  to  be 
explained.  Each  node  represents  some  proposition,  and  the  connections  explicitly  detail 
the  relationships  between  different  propositions.  It  has  been  shown  in  [3]  that  belief 
revision  in  Bayesian  networks  [7]  can  be  accurately  modeled  by  cost- based  abduction. 
The  costs  are  interpreted  as  negative  log  probabilities.  Unfortunately,  computing  the 
minimal  cost  explanation  has  also  been  shown  to  be  NP-hard  [3]. 
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Recent  empirical  [2,9]  studies  have  shown  that  many  interesting  cost-based  abduc¬ 
tion  problems  can  be  solved  efficiently  performing  better  than  existing  best-first  search 
techniques  by  considering  the  linear  program  relaxation  of  their  integer  program  for¬ 
mulation.  In  particular,  a  significant  number  of  test  cases  [2,9]  were  solved  through 
linear  programming  alone  without  resorting  to  branch  and  bound.  Linear  programming 
is  known  to  be  solvable  in  polynomial  time  [6. 10]. 

We  study  this  phenomenon  and  determine  the  conditions  for  solving  abduction  prob¬ 
lems  in  polynomial  time.  Our  approach  ties  the  concept  of  total  unimodularity  from 
network  flow  analysis  to  our  cost-based  problem.  Total  unimodularity  is  a  fundamental 
result  in  optimization  on  polynomial  solvability  [6,10].  Furthermore,  for  those  ab¬ 
duction  problems  requiring  branch  and  bound,  we  present  a  new  heuristic  based  on 
our  results  which  potentially  reduces  the  overall  number  of  branches  that  need  to  be 
explored. 


2.  Solvability 

Cost-based  abduction  problems  are  defined  as  directed  graphs,  called  waodaGs  (or, 
weighted  OR /AND  directed  acyclic  graphs),  where  the  nodes  represent  propositions, 
the  connections  represent  logical  relationships,  and  the  weights  at  each  node  represent 
the  costs  [3,9].  Semantically,  these  costs  can  be  treated  as  the  negative  logarithms  of 
conditional  probabilities  as  shown  in  [3]. 

Notation.  R+  denotes  the  positive  reals. 

Definition  2.1.  A  WAODAG  is  a  4-tuple  (G,c.r,S),  where: 

( 1 )  G  is  a  directed  acyclic  graph,  G  =  ( V.  E) . 

(2)  c  :  V  —>  K+  is  called  the  cost  function. 

(3)  r  :  V  — *  {AND, or}  is  called  the  label.  Such  nodes  are  called  AN D-nodes  and 
OR -nodes  respectively. 

(4)  SC  V  is  called  the  evidence  nodes. 

Notation.  For  each  node  q  6  V,  Dq  =  {p  |  (p.q)  €  E},  the  parents  of  q. 

Notation.  |  •  |  denotes  the  cardinality  of  a  set. 

An  explanation  of  the  evidence  is  the  same  as  a  proof.  More  formally,  we  define  this 
as  follows: 

Definition  2.2.  A  truth  assignment  for  a  WAODAG  W  —  ( G,c,r,S )  where  G  =  (  V.  E) 
is  a  function  e  :  V  — >  {true,  false}.  We  say  that  such  a  function  is  an  explanation  for 
W  if  and  only  if  the  following  conditions  hold: 

( 1 )  Vq  e  V  s.t.  r{q)  =  and,  e(q)  =  true  =>  Vp  e  V  s.t.  (p, q)  6  E,e(p)  =  true. 

(2)  Vp  G  V  s.t.  r(q)  =  OR ,e(q)  =  true  =>  3p  €  V  s.t.  ( p,q )  S  E  and  e(p)  =  true. 

(3)  Vi q  £  S,  e(q)=true. 
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Fig.  1.  A  simpler  waodag.  The  AND-node  house-quiet  is  the  observation.  The  nodes  no-shows, 
no-one-home  and  bad-songs  are  the  hypotheses  with  associated  costs  6,  7  and  3,  respectively. 

To  order  the  possible  explanations,  we  define  the  cost  of  an  explanation  as  follows: 

Definition  2.3.  The  cost  C  of  an  explanation  e  for  W  =  ( G,c,r,S )  where  G  =  ( V,  E) 
is 

C(e)=  ]T  c(q).  (1) 

{<?€V|e(<7)=true} 

An  explanation  e  which  minimizes  C  is  called  a  best  explanation  for  W. 

Note  that  conditions  (1)  and  (2)  in  Definition  2.2  above  are  relaxed  from  the  original 
definition  found  in  [9],  According  to  Theorems  2.14  and  2.15  from  [9],  the  best 
explanation  defined  here  can  be  transformed  into  a  best  explanation  for  the  original 
definition.  Such  a  transformation  can  be  achieved  in  0(|£|)  steps. 

In  Fig.  1,  assume  that  house-quiet  =  true  is  the  observation  to  be  explained.  One 
possible  explanation  would  be  the  following  assignment  of  truth  values:  {house-quiet, 
radio-off,  bad-songs,  tv-off,  no-shows}  are  assigned  true  and  {no-one-home}  is  assigned 
false.  A  second  possibility  is  to  assign  all  of  them  to  true.  And  as  a  third  possibility, 
we  can  assign  {house-quiet,  radio-off,  tv-off,  no-one-home}  to  true  and  {bad-songs, 
no-shows}  to  false.  We  find  that  our  three  explanations  above  have  costs  9,  16  and  7, 
respectively.  Of  the  three,  our  best  explanation  is  the  third  one  with  the  cost  of  7. 

Given  a  WAODAG  W  =  ( G,c,r,S )  where  G  =  (YE),  construct  a  0-1  integer  linear 
program,  L(W),  as  follows:  The  real  variables  of  L(W)  is  the  set  of  variables  indexed 
by  V,  that  is,  {xq  \  q  G  V}.  These  variables  are  governed  by  the  constraints 

•  For  each  xq, 

(1)  if  r(q)  =  AND,  then 

Xq  ^  Xp  (2) 


for  each  p  G  Dq\ 
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(2)  if  r(q)  =  OR,  then 

xp  ^  xq.  (3) 

•  For  each  q  £  S,  xq  -  1 . 

•  For  each  xq,  xq  is  either  0  or  1 . 

The  objective  function  being  minimized  is 

&UW)  =  'Y^xqc{q)- 

q€V 

Note:  We  have  only  taken  a  subset  of  the  constraints  from  the  original  formulation  [9] 
for  our  transformation.  In  particular,  the  top-down  constraints  in  [9]  are  not  considered. 2 
Again,  according  to  Theorems  2.14  and  2.15  from  [9],  this  system  is  sufficient  to  find 
a  best  explanation  for  W. 

Theorem  2.4.  An  optimal  solution  for  L(W)  is  a  best  explanation  for  VT. 

For  Fig.  1,  we  have  the  following  system  of  linear  inequalities: 


•^house-quiet  -^tv-ofl ■ 

^house-quiet  Aradio-off, 

-^house-quiet  =  1- 

V bad-songs  A no-one-home  h'  A tv-off' 

Ano-shows  T  -Vno_one_home  hS  Aradio-off, 

and  the  following  objective  function 

@L(W)  ~  oX bad-songs  T  7xno_one-home  T  bAno- shows  ■ 

With  the  transformation  to  integer  linear  programming,  we  now  examine  the  total 
unimodularity  [6,  10]  of  the  constraint  system.  Without  loss  of  generality,  assume  that 
W  is  a  binary  graph,  that  is,  each  node  has  either  0  or  2  parents  only.  Any  waodag 
can  be  directly  transformed  into  a  binary  graph  with  at  most  a  linear  increase  in  the 
number  of  nodes  and  edges. 

As  mentioned  in  [9],  the  optimal  solution  to  L(W)  may  not  be  integral  (or  more 
specifically,  0-1).  Consider  the  simple  waodag  in  Fig.  2.  From  this  WAODAG,  we 
have  the  following  system  of  linear  inequalities: 


Afv-off  T  Aradio-off  f  Ahouse-quiet , 
Ahouse-quiet  =  1  , 


Atv-off  Ano-one-home , 
Aradio-off  Ano-one-home , 


2  This  corresponds  to  the  semi-induced  constraint  system  of  1 9 1 . 
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Fig.  2.  A  waodag  with  a  non-integral  optimal  for  L(W). 
and  the  following  objective  function 

@L(W)  =  7xno.0ne-home  • 

We  can  easily  show  that  the  solution  which  minimizes  the  objective  function  is  as 
follows,  if  house-quiet  =  1*  radio-off  =  0.5,  Xtv-off  =  0.5,  and  Xno-one-home  =  0.5  with 
@L(W)  =  3.5.  The  primary  cause  for  the  0.5s  arises  from  the  inequality  for  the  OR- 
node  house-quiet. 

Consider  the  following  canonical  form  optimization  problem: 
min  cx 

X 

s.t.  Ax  <  b,  (4) 

X[  is  0  or  1. 

The  Xj  correspond  to  our  xq  where  each  q  is  a  node  above,  x  is  called  the  solution 
vector.  Each  c,  in  c  corresponds  to  the  cost  of  each  node  represented  by  x,  and  c  is 
called  the  cost  vector.  Finally,  we  can  rewrite  our  above  waodag  linear  inequalities 
as  a  matrix  and  vector  multiplication  where  A  is  called  the  constraints  matrix  and  b  the 
constraints  vector. 

Observe  that  each  column  in  the  matrix  A  corresponds  to  a  unique  node  in  W  and 
vice  versa.  For  Fig.  1,  we  have  the  following  system: 

^house-quiet 
X  tv-  off 
•fradio-off 

X  = 

Xbad-songs 
X  no-one-home 
X  no-shows 
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Next,  we  define  the  concept  of  parity  as  follows: 

Definition  2.5.  Given  an  undirected  simple  cycle, 

T  =  {(P\,Pl),  (P2,Pl), - (P„-\,P\)}, 

in  G,  the  parity  of  r  is 

| {yo  £  V  |  r(p)  =  OR,  (p,p)€T,  (p,q)  £  t,  and  Dp  =  {p,q}}\  modulo  2.  (5) 

Furthermore,  if  the  parity  equals  1 ,  then  r  is  an  odd-cycle.  Otherwise,  it  is  an  even-cycle. 

Intuitively,  parity  is  a  measure  on  the  number  of  OR-nodes  and  their  parents  that  are 
traversed  by  the  cycle.  We  now  extend  this  over  W. 

Definition  2.6.  W  is  parity-balanced  if  for  any  undirected  simple  cycle  t  in  G,  r  is  an 
even-cycle. 

Notation.  For  simplicity,  we  can  also  represent  r  by  the  sequence  of  nodes  {p\,pi, . . . , 
pn}  where  p„  =  p\. 

Definition  2.7.  A  is  said  to  be  totally  unimodular  if  all  of  its  square  submatrices  have 
determinant  equal  to  either  0,  +1,  or  -1. 

Consider  the  following  theorem  on  total  unimodularity  from  [  10,  Theorem  19.3(iv)  ] : 

Lemma  2.8.  A  is  totally  unimodular  iff  each  collection  of  columns  of  A  can  be  split 
into  two  parts  so  that  the  sum  of  the  columns  in  one  part  minus  the  sum  of  the  columns 
in  the  other  part  is  a  vector  with  entries  only  0,  +1 ,  and  —1. 

Based  on  our  derivation  of  A  from  W,  the  entries  in  A  are  either  0,  +1,  or  -1. 
Furthermore,  for  any  submatrix  A'  formed  by  arbitrarily  selecting  sets  of  columns  from 
A,  each  row  in  A'  will  have  one  of  the  following  configuration  of  non-zeros: 

( 1 )  All  zeros. 

(2)  One  ±1. 

(3)  One  +1  and  one  —1. 

(4)  Two  —  Is. 

(5)  One  +1  and  two  —  Is. 
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With  regards  to  Lemma  2.8  on  columns  splitting,  the  first  two  configurations  will  have  no 
effect.  The  third  configuration  requires  that  the  two  columns  involved  must  belong  in  the 
same  grouping,  otherwise,  the  results  for  that  row  will  be  ±2.  The  fourth  configuration 
requires  that  the  two  columns  be  separated.  The  last  configuration  prohibits  placing  the 
+  1  column  in  one  group  and  both  the  -  Is  in  the  other  group. 

Intuitively,  Lemma  2.8  represents  a  dependency  graph  between  the  columns  in  the 
matrix.  As  long  as  this  graph  is  bipartite,  we  can  partition  up  the  columns  to  satisfy  the 
lemma.  Our  goal  is  to  show  that  parity-balance  guarantees  a  bipartite  graph  and  vice 
versa.  We  now  prove  the  following  theorem  on  total  unimodularity: 

Theorem  2.9.  A  is  totally  unimodular  iff  W  is  parity -balanced. 

Proof.  ( => )  Let  A  be  totally  unimodular  and  assume  that  W  is  not  parity-balanced.  This 
implies  that  there  exists  an  odd-cycle  r  in  G.  Let  r  =  {pi  ,p2, . . .  ,p„}  where  pn  =  p\ . 
An  OR-node  p  is  said  to  be  completed  in  r  if  it  contributes  to  the  parity  of  r,  i.e., 
r(p)  =  OR,  ( p,p )  G  r,  {p,q)  G  r,  and  Dp  =  {p,  q}.  The  number  of  completed  nodes 
in  t  is  odd. 

Rewrite  r  as  follows: 

{<?l.Pl,l>Pl,2,  •  •  •  ,P1,»I1»<?2,P2,1.  •  •  •  » P2,ni2  >  *73  >  •  •  -  ,qk>Pk,  1 ,  •  •  ■  ,Pk,mk,q\} 

where  {<71, . . .  ,<7*}  are  all  the  completed  OR-nodes  in  r.  Let  atJ  be  the  column  in  A 
associated  with  pij  for  j  =  1 , . . . ,  m,  and  i  =  1 , . . . ,  k.  Form  a  collection  of  columns 
with  this  set  and  call  the  matrix  A'.  From  Lemma  2.8,  if  the  columns  in  A'  can  be 
split,  then  the  columns  of  any  submatrix  of  A'  formed  by  removing  rows  from  A'  can 
also  be  split.  There  may  be  OR-nodes  in  r  whose  parents  are  both  also  in  r  but  is  not 
completed.  Eliminate  the  rows  relating  such  OR-nodes  to  their  parents,  i.e.,  those  from 
inequality  (3).  Now,  the  remaining  rows  in  A'  can  be  in  any  configuration  except  the 
fifth  one  as  described  earlier. 

Claim.  For  each  i  =  1 , . . . ,  k,  all  Pij  for  j  =  1 , . . . ,  m,-  must  belong  in  the  same  group. 

Consider  any  two  consecutive  nodes  and  Pij+\  in  r.  One  is  necessarily  the  parent 
of  the  other.  Without  loss  of  generality,  say  pij  is  the  parent.  If  p,-,;+i  is  an  AND-node, 
then  there  is  a  row  in  A'  that  satisfies  configuration  (3)  with  the  +1  in  a,J+i  and  -1 
in  Oij.  Similarly,  this  holds  even  if  Pij+\  is  an  OR-node,  since  it  cannot  be  a  completed 
node.  This  implies  that  the  two  nodes  must  belong  to  the  same  group  since  the  columns 
must  belong  in  the  same  part  in  order  to  satisfy  Lemma  2.8. 

Denote  these  groups  simply  by  P,. 

Claim.  Pi  can  never  be  grouped  with  P*+ 1  for  i  =  1  1  and  P*  cannot  be  grouped 

with  P\ . 

For  each  completed  node  <7,,  and  p,,i  are  the  parents  of  <7,  for  i  =  2, . . . ,  k , 

and  for  <71,  the  parents  are  pk,mt  and  p\:\.  Also,  there  must  exist  a  row  of  configuration 
(4)  above  such  that  the  two  —Is  correspond  to  the  parents.  This  implies  that  these  two 
nodes  must  always  be  in  separate  groups  to  satisfy  Lemma  2.8.  Thus,  P,  can  never  be 
grouped  with  P,+ 1  for  i  =  1 .....  A:  —  1  and  P*  cannot  be  grouped  with  P\ . 
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For  k  =  1,  we  have  an  inconsistency  where  p u  and  pi>mi  must  both  be  in  the  same 
group  as  well  as  both  be  separated. 

For  k  >  1,  this  leaves  us  with  the  problem  of  merging  the  P,  until  we  only  have 
two  groups  while  simultaneously  satisfying  the  grouping  and  separation  constraints.  We 
can  represent  this  by  an  undirected  graph  A  whose  nodes  correspond  to  the  P,  and  arcs 
between  the  nodes  correspond  to  a  separation  condition.  Our  problem  now  reduces  to 
whether  A  is  bipartite. 

Clearly,  A  is  a  simple  cycle  of  odd  length  since  the  number  of  completed  nodes  is 
odd.  Thus,  A  cannot  be  bipartite  [5],  Contradiction. 

(4=)  Let  W  be  parity-balanced  and  let  {a\,a2, . . .  ,a„}  be  any  collection  of  columns 
in  A.  Let  A'  be  the  submatrix  of  A  formed  from  the  a,. 

Let  V'  =  {p\  ,p2, . . .  ,pn}  be  the  nodes  associated  with  the  columns  a,.  Construct  the 
subgraph  G'  —  ( V',E ')  from  G  as  follows:  For  all  i,  j  from  1  to  n,  if  ( PhPj )  £  E,  then 
(Pi,Pj)  £  E' . 

We  say  an  OR-node  is  weakly  completed  with  respect  to  G'  if  all  its  parents  are 
also  in  G! .  We  observe  the  following:  Given  ( p,,p; )  £  E\  (i)  r(pj)  =  AND  or  (ii) 
r(pj)  =  OR  and  p ,•  is  not  weakly  completed  if  and  only  if  there  exists  exactly  one  row 
in  A'  of  configuration  (3)  such  that  the  non-zeros  are  only  in  a,  and  aj. 

Define  a  relation  R  on  V  such  that  p,Rp,  iff  i  =  j  or  there  exists  an  undirected  path 
t  from  pj  to  pj  where  r  does  not  contain  any  weakly  completed  OR-node.  Clearly,  R  is 
an  equivalence  relation  and  let  a  =  {er|,er2,. . .  ,<rk}  be  the  partition  on  V'  induced  by 
R.  Intuitively,  these  ak  represent  the  necessary  grouping  of  nodes/columns  to  guarantee 
total  unimodularity. 

Next,  construct  an  undirected  graph  A  whose  nodes  correspond  to  each  a k.  For  each 
row  of  configuration  (4)  on  A',  if  the  -  Is  occur  in  a,  and  a,,  then  connect  cr*.  to 
where  pt  £  <rki  and  p-,  £  akj. 

Claim.  <rk  is  never  directly  connected  to  itself. 

If  both  pi  and  pf  belong  to  crk,  then  there  is  an  undirected  path  from  p,  to  pj 
containing  no  weakly  completed  OR-nodes.  By  construction  of  A  from  W,  pi  and  are 
the  parents  of  some  OR-node  q  outside  of  V' .  We  can  construct  a  cycle  by  including 
q.  By  Definition  2.5,  this  is  an  odd-cycle.  This  would  contradict  our  assumption  of 
parity-balance  for  W. 

Finally,  we  must  consider  rows  of  configuration  5  in  A' .  Such  rows  indicate  weakly 
completed  OR-nodes  in  G' .  Assume  p,  is  such  a  node  and  p7l  and  pn  are  the  parents  of 
Pi- 

Claim.  Both  ph  and  pn  cannot  belong  in  the  same  partition  ak. 

Otherwise,  there  is  an  undirected  path  from  p-n  to  pr;  containing  no  weakly  completed 
OR-nodes.  We  can  construct  a  cycle  in  the  path  by  including  p,-.  By  Definition  2.5,  this 
is  an  odd-cycle.  This  would  contradict  our  assumption  of  parity-balance  for  W. 

Since  the  parents  of  a  weakly  completed  OR-node  cannot  be  in  the  same  partition, 
place  an  arc  in  A  between  the  corresponding  partitions  containing  the  parents. 

As  before,  we  must  now  prove  that  A  is  bipartite.  Assume  that  A  has  an  odd-length 
cycle.  Each  arc  in  A  corresponds  to  an  OR-node  in  G.  We  can  construct  an  undirected 
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cycle  in  G  by  first  constructing  path  segments  which  are  free  of  weakly  completed 
OR-nodes  in  the  nodes  of  A  being  traversed  and  join  them  via  the  OR-nodes  represented 
by  the  arcs.  Hence,  we  have  an  odd-cycle  in  G.  Contradiction.  Therefore,  A  is  bipartite 
and  A'  satisfies  the  columns  splitting  of  Lemma  2.8.  □ 

The  Simplex  Method  for  solving  linear  programs  searches  along  the  vertices  of  the 
polyhedron  defined  by  the  constraint  system.  Combined  with  the  following  theorem  from 
[10,  Theorem  19.3(H)]: 

Theorem  2.10.  A  is  totally  unimodular  iff  for  each  integral  vector  b,  the  polyhedron 
{x  |  x  >  0,  Ax  ct  b}  has  only  integral  vertices. 

This  guarantees  that  the  optimal  solution  found  by  Simplex  will  be  integral.  Hence, 
Simplex  alone  will  solve  WAODAG  W  if  W  is  parity-balanced  without  resorting  to 
branch  and  bound. 

However,  Simplex  has  an  exponential  worst-case  run-time.  Linear  programming  prob¬ 
lems  can  be  solved  in  polynomial  time  through  methods  such  as  Khachiyan’s  [10] 
but  these  methods  do  not  necessarily  find  an  optimal  solution  which  is  integral.  It  is 
possible  for  there  to  exist  an  integral  and  a  non-integral  solution  both  of  which  are 
optimal  (same  objective  value).  Simplex  searches  for  an  optimal  solution  by  exploring 
the  extreme  points  of  the  constraint  space  [4,6].  Total  unimodularity  guarantees  that 
all  extreme  points  are  integral.  Hence,  Simplex  will  find  an  integral  optimal.  As  it  turns 
out,  taking  the  solution  generated  with  Khachiyan’s  method,  the  optimal  integral  solution 
can  be  found  in  polynomial  time  when  W  is  parity-balanced.  This  follows  from  [10, 
Theorem  16.2]. 

Let’s  consider  two  simple  WAODAGs  in  Figs.  3  and  4  where  the  first  is  of  even  parity 
and  the  other  is  of  odd  parity.  Let 


x  = 


■^house-quiet 

-^tv-off 

radio-off 


•^no-one-home 


The  corresponding  constraint  matrices  are 


^even  ~ 


'  1 

-1 

0 

o' 

_ 

0 

i 

-1 

-1 

0 

0 

1 

-1 

0 

?  ^odd  — 

0 

1 

-1 

0 

t 

0 

-1 

0 

0 

1 

-1 

0 

0 

1 

-1. 

According  to  Lemma  2.8,  we  find  that  taking  the  last  three  columns  of  Aod(i,  there  is 
no  way  to  split  them  into  two  sets  to  satisfy  the  lemma.  On  the  other  hand,  Aeven  does 
indeed  satisfy  the  conditions  for  total  unimodularity. 
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no-one-home-7 


Fig.  3.  A  simple  even  parity  waodag. 


Fig.  4.  A  simple  odd  parity  waodag. 


Proposition  2.11.  Cost-based  abduction  with  parity-balanced  WAODAG ^  can  be  solved 
in  polynomial  time. 

The  total  unimodularity  of  a  matrix  can  be  determined  in  polynomial  time  according 
to  [10,  Theorem  20.3].  Hence,  we  can  determine  parity-balance  also  in  polynomial 
time.  An  alternative  to  this  approach  is  to  check  for  parity-balance  directly  for  each 
undirected  cycle  in  the  graph.  The  problem  of  generating  the  cycles  can  be  easily  done 
in  0(  ( |  V|  + 1 E\ )  ( n  +  1 ) )  where  n  is  the  number  of  cycles  generated.  Basically,  a  depth- 
first  search  is  used  whereby  through  an  elegant  approach  to  adding  edges  will  generate 
each  cycle.  Details  and  analysis  of  the  algorithm  can  be  found  in  [8], 
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Now,  consider  the  case  when  the  constraint  system  is  not  totally  unimodular.  The 
vertex  corresponding  to  the  optimal  solution  in  this  polyhedron  may  still  be  integral. 

Let  n  be  the  number  of  variables  and  m  be  the  number  of  equations  in  our  constraint 
system.  For  each  vertex  v  in  our  polyhedron,  there  are  n  constraints  which  define  v. 
Construct  a  subgraph  Gv  from  G  as  follows: 

•  For  each  inequality  like  (3),  we  include  the  OR-node,  its  parents  and  the  edges 
between  them. 

•  For  each  inequality  like  (2),  we  include  the  AND-node,  the  single  parent  involved 
and  the  edge  between  them. 

Constraints  for  evidence  are  ignored  since  they  will  have  no  impact  on  total  unimodu¬ 
larity  of  the  resultant  matrix.  (The  row  associated  with  such  a  constraint  has  a  single 
non-zero  value  of  1.)  Parity  and  parity-balanced  can  be  naturally  extended  to  Gv  even 
though  AND-nodes  may  only  have  one  parent. 

Theorem  2.12.  If  G,  is  parity-balanced ,  then  v  is  integral. 

Proof.  If  Gv  is  parity-balanced,  then  it  follows  with  slight  modifications  from  Theo¬ 
rem  2.9,  that  the  coefficient  matrix  of  the  n  equations  is  totally  unimodular.  Thus,  v  is 
integral.  □ 

The  integrality  of  a  vertex  depends  on  its  associated  subgraphs.  Hence,  linear  pro¬ 
gramming  alone  can  also  result  in  an  integral  optimal  solution. 

Let  W'  -  (G1  ,c'  ,r'  ,S')  where  G'  =  (V',E')  be  a  WAODAG. 

Definition  2.13.  W’  is  said  to  be  a  partial  WAODAG  of  W  if  the  following  conditions 
hold: 

•  V1  =  V. 

•  G'  is  a  subgraph  of  G  such  that  Vp  E  V,  if  r(p)  =  OR  then  either 

-  \/(q,  p)  E  E,  ( q,p )  E  E' ,  or 

-  V(g,p)  E  E,  ( q,p )  is  not  in  £'. 

•  c'  =  c. 

•  r'  =  r. 

•  S'  =  S. 

Theorem  2.14.  Given  a  WAODAG  W,  let  e  be  a  best  explanation  for  W.  We  can 
determine  a  best  explanation  for  W  in  polynomial  time  if  there  exists  a  partial  WAODAG 
W'  of  W  such  that  W'  is  parity-balanced  and  e  is  also  a  best  explanation  for  W' . 

Proof.  Observe  that  the  polytope  for  W'  is  a  superset  of  the  polytope  for  W.  In  fact, 
the  matrix  for  W'  is  a  submatrix  of  the  one  for  W.  Furthermore,  since  e  is  also  a 
best  explanation  for  W' ,  there  cannot  exist  any  solution  x  in  W  such  that  cx  <  C(e) 
regardless  of  whether  x  is  integral  or  non-integral. 

With  Khachiyan’s  method,  we  can  find  C{e)  =  max{cx  |  Ax  ^  b\  and  a  system  of 
linear  equations  A'x  =  b'  determining  a  minimal  face  F  of  {or  [  Ax  f  b}  so  that  each 
x  in  F  satisfies  cx  =  C(e)  [10]. 
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We  can  find  an  optimal  integral  solution  tor  W  in  polynomial  time  according  to 
Corollary  5.3b  from  [  10]  which  states  that:  Given  a  system  of  rational  linear  equations, 
we  can  decide  if  it  has  an  integral  solution,  and  if  so,  find  one,  in  polynomial  time.  □ 

From  Theorem  2.14.  it  is  not  necessary  for  the  original  graph  to  be  parity-balanced 
to  ensure  that  an  optimal  solution  to  the  derived  constraint  system  is  integral.  As 
long  as  there  exists  a  partial  WAODAG  which  is  parity-balanced  and  shares  the  same 
optimal  integral  solution  as  the  original.  Simplex  alone  can  be  used  to  determine  a  best 
explanation.  This  can  serve  to  explain  the  empirical  results  in  [2,9]. 

Finally,  given  the  observations  on  parity-balance,  the  clamping  of  evidence  nodes 
in  W  can  also  impact  total  unimodularity  indirectly.  Construct  a  new  waodag  W\  = 
( G',c,r,S )  where  G'  =  ( V,E ')  from  W  as  follows:  Initially,  E'  =  E.  Remove  edge 
e  =  (p,q)  from  E'  if  DqnS  is  not  empty  and  r{q)  =  OR.  Intuitively,  q  is  automatically 
supported  by  one  of  its  parents. 

Proposition  2.15.  The  best  explanation  for  VV |  is  the  best  explanation  for  W. 

W\  reduces  the  size  of  the  problem  and  eliminates  potential  odd  parity  cycles  from  W. 
More  importantly,  this  generalizes  to  the  branch  and  bound  process  further  eliminating 
such  cycles. 

In  the  bounding  process,  a  variable  xq  can  be  clamped  to  either  0  or  1 .  Proceeding 
down  the  branch  and  bound  tree,  each  problem  instance  has  two  associated  sets  of 
nodes  Vo  and  V]  3  S  which  must  be  clamped  to  0  and  1  respectively.  Let  V0  be  defined 
recursively  as  follows: 

•  V0  C%.  _  _ 

•  If  r(q)  -  AND  and  Dq  D  Vf)  is  not  empty,  then  q  £  Vo. 

•  If  r(q)  =  OR  and  Dq  C  V0,  then  q  £  Vo. 

If  Vb  FI  V\  is  not  empty,  the  problem  instance  is  infeasible.  Otherwise,  construct  W' 
from  W  as  follows: 

( 1 )  For  each  p  £  Vo,  remove  all  edges  incident  on  p  from  G. 

(2)  If  Dq  fl  V)  is  not  empty  and  r(q)  =  OR,  remove  edge  e  =  ( p,q ). 

(3)  If  Dq  C  V i  and  r(q)  =  AND,  remove  edge  e  =  (p,q). 

Definition  2.16.  e  is  a  consistent  explanation  for  W  with  respect  to  Vo  and  Vi  if  the 
following  conditions  hold: 

•  e  is  an  explanation  for  W. 

•  Vq  £  Vo,  e(q)  -  false. 

•  Vq  £  V ,  e(q)  -  true. 

The  best  consistent  explanation  is  one  which  minimizes  the  cost  function  C(  ). 

The  following  relationship  between  W  and  W'  holds: 

Theorem  2.17.  e  is  a  consistent  explanation  for  W  with  respect  to  Vo  and  V\  iff  e  is  a 
consistent  explanation  for  W'  with  respect  to  Vo  and  V] . 
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Proof.  (=>)  Let  e  be  a  consistent  explanation  for  W  with  respect  to  V0  and  V\.  By  the 
construction  of  W'  above  as  a  subgraph  of  W,  e  is  an  explanation  for  W' .  Also,  dq  G  V, , 
e(q)  =  true.  Finally,  from  Definition  2.2  and  the  construction  of  %  above,  \fq  G  %, 
e(q)  =  false.  Therefore,  e  is  a  consistent  explanation  for  W  with  respect  to  %  and  V\. 

(4=)  Let  be  a  consistent  explanation  for  W'  with  respect  to  and  Vj.  Since  V0  is 
a  subset  of  Vfo,  for  all  q  in  To,  e{q)  =  false.  Also,  for  all  <7  in  Vj,  e(q)  =  true. 

If  e'  is  not  an  explanation  for  W,  then  e'  violates  one  of  the  following  conditions 
from  Definition  2.2: 

(1)  3q  G  V  s.t.  r(q)  =  AND,  e(q)  =  true,  and  3 p  G  Dq(W)  s.t.  e(p)  =  false. 
Dq(W )  are  the  parents  of  q  in  W. 

(2)  3<jr  G  V  s.t.  r{q)  =  OR,  e(<7)=true,  and  Vp  G  Dq(W),  e(p)  =  false. 

(3)  3q  G  S  s.t.  e(q)  =  false. 

Case  (1).  Let  p  and  q  be  the  two  nodes.  If  p  G  Dq(W'),  then  e'  cannot  be  an 
explanation  for  W' .  If  p  is  not  in  Dq(W'),  by  our  above  construction,  p  G  Vo  or  q  G  Vo. 
If  q  G  Vo,  then  e'  cannot  be  a  consistent  explanation  for  VV"  with  respect  to  Vo  and  Vi . 
If  p  G  M>,  then  q  must  also  be  in  Vf) .  Contradiction. 

Case  (2).  Let  q  be  the  node.  If  Dq(W')  is  not  empty,  then  e'  cannot  be  an  explanation 
for  W.  For  each  p  G  Dq(W ),  either  p  G  Vo  or  p  G  Vi.  p  cannot  be  in  V\  since  e(p)  = 
false.  Thus,  Dq(W)  C  Vo  implying  that  q  G  V0.  Contradiction. 

Case  (3).  5  =  5'.  Contradiction. 

Therefore,  e'  is  an  explanation  for  W.  Thus,  e  is  a  consistent  explanation  for  W  with 
respect  to  Vo  and  V\.  □ 

Thus,  W  and  W  are  equivalent  WAODAGs  under  their  respective  clampings. 

Theorem  2.18.  The  best  consistent  explanation  for  W  with  respect  to  Vo  and  V]  is  the 
best  consistent  explanation  for  W  with  respect  to  Vo  and  V\ . 

Proof.  Follows  from  Definition  2.16  and  Theorem  2.17.  □ 

This  leads  to  the  following  new  heuristic  for  branch  and  bound:  Let  Va  C  V  be  the  set 
of  all  OR-nodes  in  G  such  that  the  OR-node  contributes  to  the  parity  of  some  undirected 
simple  cycle  in  G. 

Theorem  2.19.  If  all  the  nodes  in  Va  are  clamped,  then  the  resulting  WAODAG  W' 
constructed  from  W  above  can  be  solved  in  polynomial  time. 

Proof.  If  all  the  nodes  in  V0  are  clamped,  then  there  cannot  exist  any  cycles  in  G' 
constructed  from  G  above  with  odd  parity.  Hence,  W'  is  parity-balanced.  From  The¬ 
orem  2.9,  the  constraint  system  for  W'  is  totally  unimodular  and  thus  solvable  in 
polynomial  time.  □ 

Theorem  2.20.  The  worst-case  number  of  branches  needed  to  find  the  optimal  integral 
solution  is  2^  ^  2'v’. 
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Proof.  Follows  from  our  above  construction  and  Theorem  2.19.  □ 


3.  Conclusion 

Parity-balance  is  a  necessary  and  sufficient  condition  for  guaranteeing  the  total  uni¬ 
modularity  of  a  cost-based  abduction  problem  for  arbitrary  cost  functions.  Thus,  parity- 
balance  also  guarantees  polynomial  solvability  by  using  our  transformation  to  integer 
linear  programming.  Furthermore,  the  integrality  of  each  vertex  in  our  polyhedron  re¬ 
lies  on  the  associated  subgraphs  of  the  cost-based  problem.  Finally,  a  new  heuristic  for 
branch  and  bound  motivated  by  our  results  provides  a  new  tighter  upper  bound  on  the 
worst-case  performance  of  integer  linear  programming  for  solving  cost-based  abduction. 

Most  work  in  cost-based  abduction  and  related  problem  domains  such  as  weighted 
abduction  [11]  have  been  mainly  empirical  in  nature.  However,  the  results  from  experi¬ 
mentation  have  indicated  general  characteristics  from  the  problem  domain  that  allow  for 
more  efficient  computations.  This  work  has  strived  to  provide  a  theoretical  explanation 
for  such  an  observation. 
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